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This note examines a problem in enumerative and asymptotic combinatorics involving the 
classical structure of integer compositions. What is sought is an analysis on average and in 
distribution of the length of the longest run of consecutive equal parts in a composition of size 
n. The problem was recently posed by Herbert Wilf (see arXiv: 0906.5196). 



A composition of an integer n is a sequence [x\, . . . , x rn ) of positive integers such that 

n = X\ + • • • + x m , and Xi > 1. 

The Xi are called the parts and n is the size of the composition. We wish to know the length of the 
longest run of equal parts (which we denote by the random variable L) in a random composition of 
size n. For instance, the composition 

3,2,1,4,4,4,4,4,7,3,5,5,4,2, 

has L — 5. A composition with L = 1 is known as a Carlitz composition. The characteristics of 
Carlitz compositions and their generating function C^(z) (see Proposition [TJ are studied in great 
detail in 01 E]. The solution to the longest run problem can be broken down into four main sections. 
In the first section, we find a family of generating functions for integer compositions that keeps track 
of the longest run of equal parts. In the second section, we analyze the generating functions using 
singularity analysis to find an asymptotic estimate of the number of compositions of size n with 
no run of length k. In the third, we use that estimate to describe the probability distribution of 
the random variable L, and in the fourth, we calculate the mean and variance of the distribution. 
The analysis here has some similarities to the analytic treatment of compositions in [TJ|4l[5], and 
the methods and notation used in this note are detailed in the book Analytic Combinatorics by 
Flajolet and Sedgewick [3] . This note was motivated by a question of Wilf, posed at the Analysis 
of Algorithms 09 Conference (Frejus, June 2009); see [5J. 
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1 Enumerative Aspects of Compositions 



The enumeration of integer compositions is easily solved using basic combinatorics. We can create 
a graphical model of a composition by representing the integers in unary using small discs ( "•" ) 
and drawing bars between some of the balls. The following is an example using the composition 
2 + 3+1 + 1 + 3= 10: 



Using this "balls-and-bars" model, we see that the number of compositions of the integer n is 

since a composition can be viewed as the placement of separation bars at a subset of the n — 1 
spaces between the balls. 

We can also find the enumeration of compositions with the symbolic method [3J p. 40]. If 
the integers are represented in unary, then the combinatorial class of positive integers (X) can be 
thought of as a sequence of atoms (Z) so that 

1=Seq> 1 (Z) => I(z) = 



1 - z 



Since an integer composition is simply a sequence of positive integers, we can easily derive the 
generating function for the class C of compositions from the specification 

C = Seq(X) C(z)=-^— = -^ = ^. 

1 - I(z) 1 - — z l — 2z 

Throughout this note, we let [z n ]f(z) be the coefficient of z n in the expansion of f(z) at 0: 

/«***=/». 

n 

We find that our result using the symbolic method is consistent with the above combinatorial 
argument, since 

\z n ]C(z) = \z n }— \z n }^^- = 2" - 2"- 1 = 2 n -\ 

1 1 W L J 1 - 2z 1 1 1 - 2z 

Now that we have a generating function for all integer compositions, we need another gener- 
ating function for compositions, which keeps track of the longest run of equal parts. We begin by 
examining Smirnov words, i.e., words over an m-ary alphabet such that no letter occurs twice in a 
row. Words over the m-ary alphabet {ai, . . . , a m } can be represented by the multivariate generating 
function 

W{xi, . . .,x m ) = ■ ■ r, 

l-[X 1 -\ h X m ) 

where Xj marks the number of times the letter aj occurs in a word. That is, the expression 
[x" 1 , . . . ,x^"]W(xi, . . . ,x m ) denotes the number of words in which the letter a\ occurs ni times, 
et2 occurs n.2 times, and so on. 

Similarly, let S(yi, . . . ,y m ) be the generating function for Smirnov words, where yj marks the 
number of times the letter a,j occurs in a word. Now, given a Smirnov word, one can obtain any 



2 



word by replacing aj with a nonempty sequence of a,j (i.e., a,j x Seq (fflj)). In terms of generating 
functions, this translates to 



W(xi, . . .,x m ) = S 



Xi 



1 — Xi 1 — x r 



We use this to find the generating function for Smirnov words in terms of the generating function 
for all words: 

Vl Vm 



1 + Vl ' ' ' 1 + Vn 



S( yi ,...,y m ) = W 
so that we have 

/ m 



S(yi, ...,y m )= 1 - X! J 



We would like to generalize S(yi, . . . ,y m ) to the generating function S^(yi, . . . ,y m ) for words 
over an m-ary alphabet such that no letter occurs k times in a row. We can obtain this via the 
substitution 

k-l k 

which yields 



S^( yi ,...,y m )= U-Et 




Integer compositions are sequences of positive integers, and hence can be thought of as "words" 
over an infinite alphabet where the "letters" are the positive integers. So, by letting m tend to 
infinity and replacing yj with z 3 , we obtain the generating function for integer compositions with 
no part appearing k times in a row. 

Proposition 1 The generating function for integer compositions with no part appearing k times 
consecutively is given by 

; 1 ~ ^Cfc-D 



"I 

oo 



j : 



1 - zi k 



We have now finished the first step of the solution, which is to find a family of generating 
functions for integer compositions, indexed by their longest run of equal parts. Our result is 
essentially equivalent to one given by Wilf (Theorem 3 of [6 ). Wilf obtains it by means of the 
correlation theory of Guibas-Odlyzko. For similar problems, Knopfmacher and Prodinger employ 
the technique of "adding a slice" [1]. Our derivation above, based on Smirnov words, has the 
advantage of great versatility, and it follows [3l p. 205]. 
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2 Singularity Analysis 



We move on to the second step of the solution, which is to view (z) as a function in the complex 
plane and perform singularity analysis. This step is much more technical than the first. The 
important fact used here is that information about the function's Taylor coefficients is contained in 
the asymptotic behavior of the function at its singularities. In fact, most of the relevant information 
is contained in the dominant singularity^] Thus, our next step is to find the dominant singularity 
of the function C^ k \z) and show that it is an isolated pole. This is established in the following two 
propositions. The analysis here is reminiscent of that of longest runs in binary words in [31 p. 308]. 
However, the case of integer compositions is somewhat more complicated because wc do not have 
rational generating functions, and hence encounter additional difficulties in finding the dominant 
singularity of C^{z). 

Proposition 2 The dominant singularity of the generating function C' fe ' (z) satisfies 

Pk = \ (1 + 2- fe - 2 + 0{k2' 2k )) , fc^oo. 



Proof. We consider the denominator of C^(z): 



DC 



i -j(fc-l) °° °° oo r j(k-l) 
1 _ V z j - - = 1 - V + V z j - V z> - 

Z_< I _ z jk Z-^i Z-^i Z-^i I _ z jk 

Let h(z) — "Yli2>\ z ^ anc ^ = 12j>i z ^ k l-zik ■ Let P k ^ e ^ ne dominant positive singularity of 
C^(z), whose existence is guaranteed by Pringsheim's Theorem j3J p. 240]. The value p k is the 
solution to the equation 

h(z) - g(z) = 1. 

Since h(z) = z/(l — z), we have that h(l/2) = 1. Also, g(z) is positive when z is real between 
and 1. Therefore, h(l/2) - 5(1/2) < 1. Simple calculations show that h(Q.6) - 5(0.6) > 1 for k > 2. 
Hence we have that 

1 3 

2 <Pk< ~5 

To get a closer estimate for p^, we consider that pk is the fixed point of the equation 

and use iterative methods to estimate the fixed point. We let 

1 l + gte) 
z = - and = — — -— . 

2 2 + g(zi) 



X A dominant singularity is one of smallest modulus. 
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Since 1/2 < pk, we have that z\ < pk, and by induction Zi—% < Zi < p^ for all i > 0. Since pk is 
the unique fixed point of equation ([TJ, the sequence {zi}°l must converge to pk- 
Before computing z±, it is helpful to simplify our definition of g(z): 

Now, we have that 

_ 1 + . 9 (l/2) _ 1 + 2- fc - 1 (l - 2- fe )- 1 + 0(2- 2k ) 
Zl ~ 2 + .9(1/2) ~ 2 + 2- fc - 1 (l -2- fe )-! + (9(2- 2fe ) 

= 1 + 2" fc - 3 + (9(2- 2fc ) 

Further iterations will increase our estimate by adding terms which are 0(k2~ 2k ), so we have that 
p k -zi = 0{k2- 2k ). Therefore, 

Pk = zi + 0(k2- 2k ) = \(1 + 2- k - 2 + 0(k2~ 2k )) , 
as desired. ■ 



Proposition 3 For k > 4, t/ie uaZwe is the only singularity of the function C' fe ' (z) m the domain 
\z\ < 3/5. 



Proof. Let 



and let 



Notice that 



f(z) = 1 — h(z), where h(z) 



g{z) = Y. z3k Y- 



C {k) {z) 



j">i 
1 



Z-? A 



l-h(z)+g(z) f(z) + g(zy 

Thus, once we have shown that \g(z)\ < |/(z)| for all z on the circle |z| = 3/5, then by Rouche's 
Theorem [3 p. 270], f(z) and f(z)+g(z) will have the same number of zeros in the domain |z| < 3/5. 
The proposition will follow immediately, since there is only one root of /(z) in that domain, which 
is z = 1/2. 

To show that \g(z)\ < \f(z)\ for all z on the circle \z\ — 3/5, we first bound \g(z)\ from above 
and then bound |/(z)| from below. On this circle, we have that 



1 



1 - z^ 



< 



< 



1 - |z^| ~ 1 -0.6 fe 



and 



\l-z j \ < l + \z j \ < 1.6, 



which implies 



i3Wi<E 



3>1 



7 -fc| 11 ~ *\ 

'll-z^l 



< 



1.6 



1 - 0.6' 



■El 

3>1 



z jfc | < 



(1.6)(0.6) fc 
(l-0.6 fc ) 2 
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For k > 4, this becomes 



We now need to bound |1 — h(z)\ from below on the circle \z\ — 3/5. That is, we need to find 
the distance between the point 1 and the image of the circle \z\ = 3/5 under the linear fractional 
transformation 

1 — z 

The image of a circle under a linear fractional transformation is again a circle. Thus, since <j)(— 0.6) = 
—0.375 and 0(0.6) = 1.5, we have that the image of {\z\ — 3/5} under <fi is the circle \z — 0.5625| = 
0.9375, which comes closest to the point 1 on the positive real axis. Therefore, 

\f(z)\ = \l-h(z)\ > |1-1.5| = 0.5. 

Therefore, for \z\ = 3/5, 

\g(z)\ < 0.274 < 0.5 < \f(z)\, 

so Rouche's Theorem can be applied. Thus there is only one root of f(z) + g(z) in the given domain, 
and that root must be p k - ■ 

Now that we have found and isolated the dominant pole, pk, we use the Residue Theorem to 
extract information about the coefficients from the behavior of C^{z) at pk- This is our main 
approximation, expressed primarily as n — > oo, but also allowing for k to get large. 

Proposition 4 The number of compositions of n with no run of k equal parts satisfies 

Ci k) = Pl n ~\l Pk) 2 (1 + e(k)) +°((|) T '). n -> °°> 
uniformly with respect to k > 4, where e(k) — 0(k2~ k ) as k — ► oo. 
Proof. The Residue Theorem gives 

1 ( >-dz = Res — ^;z = +Res — ^;z = p k 



2ni J\ z \ =3 / 5 z n+1 ' \ z n+1 ' J \ z n+1 

= C<*>+ Res * = (2) 

On the other hand, previous arguments have shown 

|1 - h(z) + g(z)\ > |1 - h(z)\ - \g(z)\ > 0.5 - 0.2737 > 0.22. 

Hence 

1 r CW(z) I f 1 dz 5 /3V"' 1 

2^i| 2 |=3/5 ^ Tl+1 Z ~ 27rii|,|=3/5 l-h(z)+g(z)z^ < tod \5 J 
Combining equations ([2| and ([3]), we obtain 

cS»-B-(^;.-») +0 ((| 
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We now need to find an estimate for the residue R n ,k '■= —Res ( z „ + V ; ; z = J • We have 
rr^ = where D fe (z) = 1 h > z Jfe r T . 

Hence 

-n-1 

p _ _£k 

A straightforward calculation gives 

D' k {z) = ^D k (z) = - 1 + (k + l)z k + kz k - x + 0(kz 2k ), 

uniformly for z near p k as k — > oo. So we have 

1 1 



D 'M - (A + l)p* - fc^ 1 + 0(fcpf ) 

= -(1 - Pfe ) 2 - (k + 1)4 ~ + 0(fc 2 pf )• 

We thus obtain 

Rn,k = Pk n ~\l - Pkf (1 + 0(k2- k )) , 

from which the result follows immediately. ■ 

This concludes the second step of the solution. In the remainder of the note, we find the 
asymptotic form of the probability distribution of the random variable L and estimate its mean 
and variance. 



3 Analysis of the Probability Distribution 

In this section we exploit the main approximations of Propositions[2]and[4]to describe the probability 
distribution of the random variable L. This development is in the scale of lg n, which is anticipated 
on probabilistic grounds (see Conclusion) . Our main goal here is to show a double exponential form 
for the bulk of the distribution. 

Theorem 1 Let L be the random variable measuring the longest run of equal parts in a random 
integer composition of size n. Then L satisfies^ 

P n (L < k) = e-<"* +a + 

uniformly for fceZ such that | lg n < k < 2 lg n. Equivalently |^] 

¥ n (L < Llgnj +h) = e-^ h - 2 (l + O (^)) , w(n) = 2^">, 



uniformly for h € Z such that — | lg n + {lg n} < h < lg n + {lg n} . 



2 In this note, P n refers to the probabilistic model where all compositions of size n are taken equally likely. 
3 Here lgn = log 2 n and {a} represents the fractional part of a (i.e., {a} = a — [a]). 
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2 4 6 8 10 12 14 16 



Figure 1: Histograms of the exact distributions P n (L) for n — 20, 40, . . . , 500. 

The formulae above do not represent a single distribution, but rather a family of distributions 
indexed by the fractional part of lg n. The second form given above shows explicitly how the 
behavior of the distribution depends on the value of n in relation to powers of 2. Figure [T] displays 
the histograms of the exact distributions of L for n = 20, 40, . . . 500. We see that the peak of the 
distribution does not increase smoothly with n, but instead incurs jumps and irregularities, which 
are a result of the distribution's dependence on {lgn}. Such a phenomenon is analogous to that 
found in the distribution of longest runs in binary words [21 p. 308]. 

Proof. Recall that the total number of compositions of size n is C n = 2 n ~ 1 . Therefore, by 
Proposition [4] 

P«(£ < k) = ^- = ^(2Pfc)- n (l " Pkf (1 + e(fc)) + (Q)") ' ( 4 ) 
where e(fc) = 0(fc2~ fe ). Note that in the region | lgn < k < 21gn, the value of (2pk)~ n satisfies 

(2p fc )-« = exp(-nlog(2p fe) ) = exp (- JJ_ + {^)) = (l + O (^)) , (5) 

and the value of the coefficient ^-(1 — p^) 2 is of the form 

A(i-^ = i + o( 2 -) = 1 + o(J 7J ). 

Hence, in this region, 



P„(i < fc) = 
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We now adjust equation Q to reflect the distribution's dependence on the placement of n with 
respect to powers of 2. The floor function is used to emphasize the fact that k must be an integer. 
Let k = lg ri + x where 16 [— | lg n, lg n]. Then we can write 

k = [lg n\ + {lg n} + x, 

and we let h = {lg n} + x. We require that h be an integer and that h — {lg n} £ [— | lg n, lg n]. In 
other words 

3 

k = [lg n\ + h. where h G Z, such that — — lgn + {lg n} < h < lg n + {lg n}. 
Inserting this into equation (JsJ) we obtain 

F n (L < [lgn\ +h) = cxp ( 2lg „_ 0g ? ; }+fe+2 ) (l + O fl0gH 



'n 



as desired. ■ 

In Theorem 1 we only considered a small central region about lgn, namely I lg n < /c < 2 lg n, 
where the bulk of the distribution is concentrated. For our subsequent analysis, we will also need 
restraints on the tails of the distribution. We do this with the following proposition. 

Proposition 5 The tails of the distribution of L are exponentially small. In particular, for k < 
| lg n, we have 

F n (L <k) = 0(e~ ^" /4 ), n -> oo, (7) 
and for k — 2 lg n + y, we have 

V n {L>2\gn + y) = o(^pj , n — oo, (8) 

uniformly for y > 0. 

In the right tail, we introduce the additional parameter of y to establish that, when k > 2 lgn, the 
probability P n (X > k) is not only exponentially small, but also uniformly decreasing as k — > oo. 

Proof. For k < | lgn, we find 

P„(L < Ife) < P„(L < | lgn) = A(i _ p,)^-™/^) ^i + o fe)) = 0(e-^/ 4 ), 

while for fc = 2 lg n + y with y > 0, we find 

¥ n (L > 2\gn + y) < 1 - P„(L < 21gn+ y) 

. 1 _l (1 _„,vr-»/ I ..)( 1+0 (i!)) 

= i-o( e -( 2 ^)/< 4 ")) 

= /_ 
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4 Mean and Variance 



Finally, we estimate the mean and variance of the distribution. As suggested by Theorem [T] the 
scale is lg n with some periodic fluctuations. The fluctuation is of a kind frequently encountered in 
the analysis of algorithms (see [2]). For instance, the equations for the mean and variance here bear 
a striking resemblance to those found by Archibald, Knopfmacher, and Prodinger in their analysis 
of the number of distinct letters in geometrically distributed sequences [T] . 

Theorem 2 The expected value of L satisfies 

M^)=l g n +i ^-f + P(lgn) + o( 1 ^ 
where P is a continuous periodic function whose Fourier expansion is given by 



6 kez\{o} v 6 



— 2ikirw 



The oscillating function P(w) has mean value and is found to have tiny fluctuations, along the 
order of 10~ 5 . 

Proof. We have 

E n (L) = ]T V n (L >k) = J2( 1 ~ r n(L < k)) . 

k>l k>l 

We evaluate the sum in three pieces: the two tails (Proposition [5]) , and the central region where 
the distribution is concentrated (Theorem flj) . For the left tail, i.e., k < | lgn, we have F n (L > k) 



l<fc<flgn l<fc<|lgn 



exponentially close to 1. More precisely, 

£ (l-P„(L<fc))= Yl (l-0(e-^ /4 )) 

fc<| lgn 

Y (1 - e-"/ 2t+2 ) + O ( e -^ 4 logn) + O (e-"/ 2fc+2 logn) 



l<fc<|lgn 



'11 



For the right tail, where k > 2 lg n, we have P n (L > k) is both exponentially small and uniformly 
decreasing as k — » oo, as shown by Proposition [5] We can therefore write 

E P n (L>k) = 0(¥ n (L>2lgn)) = o(- 

fe>21gn v 

By a similar argument, we can also show that 

E (l-e-^ k+2 )=0(l/n). (10) 

fe>21gn 
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Finally, for the central region, we have 

P n (L < fc) = < 

Hence 

J2 (1-Pn(i<*))= E ( 



1 + 



log 11 



l-e 



| len<fc<21gn 



lgn<fc<21gn 



+e-»/' w, o(!^)o(log») 



E I 1 

; lgn</c<21gn 



log n 



(11) 



Combining equations d9|, (10 I, and (111, we obtain 



E„(L) = £(l- e -/ 2fc+2 ) + 



fe>i 



log 2 n 



which can be rewritten as 



E„(L) = * (4 ) - 1 + O 



log 2 n 



where = ^(1 - e -a:/2h ). 



(12) 



ft>0 



We can now obtain precise asymptotic information about $(x) through Mellin transform tech- 
niques (see [5]). The Mellin transform of 4>(x) is 



/>oo 

$*(s) := / $(x) 
Jo 



X s dx 



1 - 2 s 



We (-1.0). 



We see that $*(s) has a double pole at s — and simple poles at s = t£^, which indicate an 
asymptotic expansion of that involves a Fourier series. We obtain 



log 2' 



7 , 1 



§{x)=\gx + ^ + - + P{\gx)+0[-), P{w) 



1 



log 2 



fcGZ\{0} 



2ikir 
log2 



-2ik7TW 



Evaluating <j>(x) at x — j and substituting into equation (12 1 gives the desired result. 
Theorem 3 The variance of L satisfies 



O 



log 4 n 



where |e| < 10 4 . 

Proof. We use the identity 



l n (L) = E n (L 2 )-E n (L) 2 , 
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and we start by computing the second moment of L. The methods are similar to those used to 
compute E„(L), and their presentation is abbreviated here. We have 



V k 2 W„(L = k) 


= y^(2fc - 1)P, 


k=l 


fc=i 


oo 

£(2fc-l)(l-l 


?n(£ < fc)) 


fe=i 




£(2*-l)(l- ( 


r"/ 2fc+2 ) + o( 








'log 3 n\ 










is given by 





/ log 3 " 



**(*) = -r(«) £(2/1 -i)2- h = --^ 



r(s)(l-3-2 s ) 



(1-2 S ) 2 ' 

h>0 y ' 

The transform has a triple pole at s = and double poles at s = for k G Z \ {0}, the singular 
expansions of which yield the following asymptotic form of ^f(x): 



g - 1 + 2P(lgz)) - | + ^ - ^ - Pfe.) + 00..) + O £ 

where P(u>) is as in the statement of Theorem [2] and Q(w) is a periodic function with Fourier 
expansion 

■) / ■) ; I,.- \ / ■) ; I - \ 

— lik-KU 



2 ^ {2ikn\ (2ikn 



We therefore have 



E„(L 2 ) = lg 2 n + lgn(-^--5 + 2P(lgr^ : ~ 



log2 7 3 log 2 2 61og 2 2 

" 7 -5P(]gn) + Q(]gn) + ofe^V (13) 



log2 vo ' ' V y/n 

Meanwhile, the square of E„(L) satisfies 



E n (L) 2 = lg 2 n + lgn — - 5 + 2P(lgn) 



log2 vo 7 4 log 2 2 



5^(lg n) + HxP(lg n) + P(lg n) 2 + O . (14) 



log 2 vo ' log 2 



12 



1.000 1,200 1,400 1,600 





1,000 1,200 1.400 1,600 



Figure 2: The functions $ (f ) - Igx - 1 (left) and * (|) + 1 — ($ (|) - l) (right), which repre- 
sent the "constant parts" in the asymptotic form of the mean and variance of L. 



Subtracting equation (14 1 from equation (13), we obtain 

^-2 



V»(L) 



1 



12 6 log 2 2 

1 7T 2 



+ Q(lgn) 



12 6 log 2 2 



O 



27 
log 2 

log 4 n 



P(lgn) - P(lgnY + o 



log 4 n 



where lei < 10 



Figure [2] displays the periodic functions associated with the mean and the variance. We see that 
the amplitudes of the fluctuations in the periodic functions are very small, about 1.6 x 1CP 6 in the 
mean and 1.5 x 1CP 5 in the variance. 



5 Conclusion 



It is well-known that the total number of summands in a random integer composition of size n is 
~ n/2 both on average and in probability. Furthermore, the number of summands equal to 1, 2, 3 . . . 
is close to f , 5, . . ., respectively [3J p. 168]. Using this elementary fact, the longest run problem 
could alternatively be approached by studying words of length n/2 and looking at the longest run 
of l's, then the longest run of 2's, and so on, and combining the results. To do so, we can make use 
of the fact that in a word of length v over the binary alphabet {a, b}, where a occurs with relative 
frequency p, the expected length of the longest run of a's is 

k = log 1/p v. 

In a random composition of size n, the summand r occurs with relative frequency approximately 
l/2 r , so we would expect the longest run of r's to be roughly (y — > n/2) 



fco(r) ~ log 2 r 



lgn 

r 



13 



Figure 3: The longest run, L r , of each integer r in four compositions of size 10 5 . The integers are 
represented in increasing order from bottom to top (the bottom line represents L\). 

This heuristic approach can be made rigorous by analytic methods. One would alter the gener- 
ating functions to record the longest run of a particular summand r in the composition, and then 
carry out analysis of the dominant pole Pfc(r). The important observation is that the exponent of ^ 
in the approximation of Pk{r) translates directly to the coefficient of lgn in the expectation of the 
longest run of r's. The technical details are entirely similar to the analysis in this note. We obtain: 

Proposition 6 Let L r be a random variable representing the longest run of r's in a composition. 
Then we have 

E„(L r ) = -lgn + 0(l). (15) 
r 

Again, the distribution is highly concentrated around the mean. We then see that L ~ L\ with 
high probability. In fact, since the length of the longest run of l's is likely to be twice that of 2's, 
and since the L r have small variance, we should expect the longest run of equal parts to almost 
certainly be a run of l's. We can even infer that the constant term of E„(Li) will coincide with 
that of E„(L), given in Theorem [2] This intuition is supported by Figure [3j which displays the 
graphs of L r for four simulations of random integer compositions of size 10° . We see that Li is 
much greater than Li in all cases. 

Equation [15] though only valid for fixed r, suggests that the largest summand (i.e., the largest 
r such that L r > 1) should be close to lgn, which is a true fact [3J p. 169]. This is also illustrated 
by Figure [3j where we see that L r is strictly positive until around r ~ lg n, when it alternates 
sporadically between and 1 until it ultimately peters out completely Such behavior suggests 
that in a random composition of size n, all summands up to lgn + O(l) will be present with high 
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probability. Meanwhile, summands at Ign + u)(n), for any uj{n) — > oo, will likely not occur at all. 
This is consistent with the analysis of Archibald, Knopfmacher, and Prodinger in [T], where the 
number of distinct values in a geometrically distributed sequence was found to be lgn + 0(1). 

In our analysis, we found a periodic fluctuation in the distribution which is common to several 
problems in analysis of algorithms. These common fluctuations are not coincidence, but rather the 
result of an underlying structure of the singularities, namely a geometric displacement of a fixed 
pole pk with respect to 1/2 (see Proposition |2| . This structure induces the same asymptotic form 
in the expectations of several values, which on the surface seem to be unrelated (see [5]). 
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